Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis
نویسندگان
چکیده
In the community of sentiment analysis, supervised learning techniques have been shown to perform very well. When transferred to another domain, however, a supervised sentiment classifier often performs extremely bad. This is so-called domain-transfer problem. In this work, we attempt to attack this problem by making the maximum use of both the old-domain data and the unlabeled new-domain data. To leverage knowledge from the old-domain data, we proposed an effective measure, i.e., Frequently Co-occurring Entropy (FCE), to pick out generalizable features that occur frequently in both domains and have similar occurring probability. To gain knowledge from the newdomain data, we proposed Adapted Naïve Bayes (ANB), a weighted transfer version of Naive Bayes Classifier. The experimental results indicate that proposed approach could improve the performance of base classifier dramatically, and even provide much better performance than the transfer-learning baseline, i.e. the Naïve Bayes Transfer Classifier (NTBC).
منابع مشابه
Sentiment Analysis using Naive Bayes
Sentiment analysis is a challenging and interesting natural language processing task, if only because it naturally lends itself to domain adaptation. We study sentiment analysis using Naive Bayes and essentially reproducing the results from [1]. We start by describing the Naive Bayes model we use, then we describe the experimental setup and finally we discuss our observations and results. The N...
متن کاملAMI&ERIC: How to Learn with Naive Bayes and Prior Knowledge: an Application to Sentiment Analysis
In this paper, we describe our system that participated in SemEval-2013, Task 2.B (sentiment analysis in Twitter). Our approach consists of adapting Naive Bayes probabilities in order to take into account prior knowledge (represented in the form of a sentiment lexicon). We propose two different methods to efficiently incorporate prior knowledge. We show that our approach outperforms the classic...
متن کاملA Text Classifier with Domain Adaptation for Sentiment Classification
In sentiment classification, traditional classification algorithms cannot perform well when the number of labeled data is limited. EM-based Näıve Bayes algorithm is often employed to argument the labeled data with the unlabeled ones. However, such an approach assumes the distributions of these two sets of data are identical, which may not hold in practice and often results in inferior performan...
متن کاملLearning Latent Word Representations for Domain Adaptation using Supervised Word Clustering
Domain adaptation has been popularly studied on exploiting labeled information from a source domain to learn a prediction model in a target domain. In this paper, we develop a novel representation learning approach to address domain adaptation for text classification with automatically induced discriminative latent features, which are generalizable across domains while informative to the predic...
متن کاملCross-domain Sentiment Classification using an Adapted Naïve Bayes Approach and Features Derived from Syntax Trees
Online product reviews contain information that can assist in the decision making process of new customers looking for various products. To assist customers, supervised learning algorithms can be used to categorize the reviews as either positive or negative, if large amounts of labeled data are available. However, some domains have few or no labeled instances (i.e., reviews), yet a large number...
متن کامل